Goto

Collaborating Authors

 absolute value


fd8872fcba4ba87312cdfe5ebba91ca9-Supplemental-Conference.pdf

Neural Information Processing Systems

The appendix includes the missing proofs, detailed discussions of some argument in the main body483 and more numerical experiments. We organize the appendix as follows:484 The proof of infeasibility condition (Theorem 3.2) is provided in Section B.485 Explanations on conditions derived in Theorem 3.2 are included in Section C.486 The proof of properties of the proposed model (r)LogSpecT (Proposition 3.4 & 3.6) is given487 in Section D and some additional properties are discussed.488 The truncated Hausdorff distance based proof details of Theorem 4.1 and Corollary 4.4 are489 given in Section E.490 Details of L-ADMM and its convergence analysis are in Section F.491 Additional experiments and discussions on synthetic data are included in Section G.492 Since the linear system (4) has no solution, we know from Farkas' lemma that the following system494 Hence, S is also a solution to (13). However, (13) does not have a solution. We can conclude that504 rSpecT is infeasible in this case.505


A Organization of the Appendix 482 The appendix includes the missing proofs, detailed discussions of some argument in the main body

Neural Information Processing Systems

The proof of infeasibility condition (Theorem 3.2) is provided in Section B. Explanations on conditions derived in Theorem 3.2 are included in Section C. The proof of properties of the proposed model (r)LogSpecT (Proposition 3.4 The truncated Hausdorff distance based proof details of Theorem 4.1 and Corollary 4.4 are Details of L-ADMM and its convergence analysis are in Section F. Additional experiments and discussions on synthetic data are included in Section G. ( m 1) Again, from Farkas' lemma, this implies that the following linear system does not have a solution: Example 3.1 we know ฮด = 2|h Since the constraint set S is a cone, it follows that for all ฮณ > 0, ฮณ S = S . Opt(C, ฮฑ) = ฮฑ Opt(C, 1), which completes the proof. The proof will be conducted by constructing a feasible solution for rLogSpecT. Since the LogSpecT is a convex problem and Slater's condition holds, the KKT conditions We show that it is feasible for rLogSpecT. R, its epigraph is defined as epi f: = {( x, y) | y f ( x) }. Before presenting the proof, we first introduce the following lemma.


Average Case Column Subset Selection for Entrywise $\ell_1$-Norm Loss

Neural Information Processing Systems

Nevertheless, we show that under certain minimal and realistic distributional settings, it is possible to obtain a (1+ null)-approximation with a nearly linear running time and poly (k/null) + O ( k log n) columns. Namely, we show that if the input matrix A has the form A = B + E, where B is an arbitrary rank-k matrix, and E is a matrix with i.i.d.





Minimal Variance Sampling in Stochastic Gradient Boosting

Neural Information Processing Systems

Differentsamplingapproaches were proposed, where probabilities are not uniform, and it is not currently clear which approach is the most effective. In this paper, we formulate the problem of randomization in SGB in terms of optimization of sampling probabilities to maximize the estimation accuracy of split scoring used to train decision trees.


1 EmbeddingMethodsinMotivatingCaseStudy

Neural Information Processing Systems

Isomap is a nonlinear dimensionality reduction method and finds low-dimensional embedding of high-dimensional data by preserving the pairwise geodesic distances between data pointsinmanifold. In2-dimensional embedding manifoldM,thegeodesic polygonal curvePi,j canbeprojected on the straight line connected it two endpoints. Every line segment ofPi,j has a corresponding line segmentinthethestraightline. The hyper-parameters searched over include the dimension of node representation as well as hyper-parameters specific to each model.